AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Edge-side Multimodal

# Edge-side Multimodal

Xinyuan VL 2B
Apache-2.0
Xinyuan-VL-2B is a high-performance multimodal large model for edge-side applications launched by Cylingo Group, fine-tuned based on Qwen/Qwen2-VL-2B-Instruct, utilizing over 5 million multimodal data points and a small amount of pure text data.
Text-to-Image Transformers Supports Multiple Languages
X
Cylingo
94
7
Minicpm Llama3 V 2 5
MiniCPM-V 2.6 is a multimodal large model launched by OpenBMB, surpassing GPT-4V in single-image, multi-image, and video understanding tasks, and supports real-time video understanding on iPad.
Image-to-Text Transformers Other
M
openbmb
31.48k
1,394
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase